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Abstract. A regular language L is said to be cellular if there exists 
a 1-dimensional cellular automaton CA such that L is the language 
consisting of the finite blocks associated with CA. It is shown that 
cellularity of a regular language is decidable using a new characterization 
of cellular languages formulated by Freiling, Goldstein and Moews and 
implied by a deep result of Boyle in symbolic dynamics. 

1. Introduction 

A 1-dimensional cellular automaton CA of radius r consists of a finite 
alphabet V and a local rule p : r 2r+1 — > T. The local rule p extends to 
a function p^ : T z — > T z as follows: for c € T z and i € Z, p{c)oo{i) = 
p(ci- r , . . . , Cj, . . . , Ci +r ). There is a vast literature about cellular automata; 
they have been used to model a wide variety of physical and mathematical 
phenomena. One aspect of the study of 1-dimensional cellular automata 
involves comparing properties of CA with those of a certain regular language, 
L(CA), which is now described. A finite block of Poo( c ) is a finite sequence 
of contiguous entries in Poo(c), of the form p 00 (c)(m), . . . ,Poo(c)(m + k), 
where m £ Z and k £ N U {0}. As the range of p^ is shift invariant, 
we may replace m by in the definition of finite block. Let L{CA) be 
{Poo (c)(0), . . . ,poo(c)(/c) : c £ T z , k G N U {0}} U {e} (where e is the empty 
word). It is well known that L(CA) is accepted by a non-deterministic 
automaton whose associated undirected graph is a De Bruijn graph. The 
reader may refer to Sutner's work [5] and [6] for further information on 1- 
dimensional cellular automata and regular languages. Kari |3] presents a 
broad range of topics and problems connected with cellular automata. 

A regular language L is said to be cellular if there exists a 1-dimensional 
cellular automaton CA such that L = L(CA). Klaus Sutner has asked 
whether cellularity is decidable in the regular languages, and his question is 
answered in the affirmative here. 

Theorem 1.1. Cellularity is decidable. 
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The algorithm constructed here depends on a very useful and apparently 
new characterization of cellular languages. The first author learned this 
characterization, given in Corollary [L2l from Freiling, Goldstein and Moews 
who had proved it in terms of cellular languages. They later realized that 
it is implied by a deep result of Boyle pQ in symbolic dynamics concerning 
mixing of sofic shifts. Notation and terminology necessary for the statement 
of Corollary 11.21 are given next. 

All languages are assumed to be over a finite, non-empty alphabet T. 
As is customary we use T* to denote all the finite word over T, including 
the empty word e. Because cellular languages are non-empty and properly 
contain {e}, all languages here will be assumed to have those properties. 
Since V* is cellular, it will also be assumed that L is properly contained in 

Definition 1.2. Let L be a language over a finite alphabet T. Four properties 
are defined. 

(1) (LI) (Factorial) Lf x,y £ T* and xy £ L, then x,y £ L. 

(2) (L2) (Receptive) There exists a letter o £ T, such that for allx,y £ L, 
there exist a,b £ L such that for all n £ N, xao n by £ L. 

(3) (L3) (Transitive) Lf x,y £ L, there exists u £ L such that xuy E L. 

(4) (L4) (Prolongable) If u E L, then there exist s,t £ L — {e} such that 
sut £ L. 

It is not difficult to verify that (LI) through (L4) are satisfied by cellular 
languages. (LI), (L3), and (L4) are familiar to those who work in cellular 
automata and symbolic dynamics; (L2) is a condition that arises naturally 
from Boyle's Theorem 3.3 [lj. Note that (L2) implies both (L3) and (L4). 

Corollary 1.3. (Freiling- Goldstein- Moews [2], Boyle \Q) A regular language 
L over an alphabet V is cellular if and only if L satisfies (LI) and (L2). 

Some results and notation from basic automata theory are briefly re- 
viewed. The reader may refer to [I] for further detail. Let L be a regular 
language over the finite language V with minimal automaton M(L) having 
states Q. Let Q acc be the set of accepting states of M(L), and let qi n i be 
the initial state of M(L). Because M(L) is the minimal automaton for L, 
all states in Q are reachable from qi n i\ that is, there exists a £ T* such that 
Qini-a = q. 

If L is presented by a DFA F, then M{L) can be determined in polynomial 
time (with instance size the number of states of F). If a regular language L 
is presented as a regular expression £, or by an NFA N, then M(L) can be 
determined in EXPSPACE from £ and N, respectively. 

For q £ Q, let L q be the language determined by the automaton having 
the same transition function and final states of M(L), but with q (rather 
than qi n i) as the initial state. Let |M(L)| be the number of states of M(L). 
In the proof of Lemma 12.11 use will be made of the fact that the state set 
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Q of M(L) can be taken to be {w/ ~l: w G T*}, where ~l= {(u,v) G 
T* : Vw G r*, uw G L o fro G L}. Note that ~l is a right congruence 
of finite index of the free monoid T* , where the action of T on Q is given 
by wj ~l~ >• w.c/ where c G T. The final states are those ~l classes 
contained in L; the initial state is e/ where e is the empty word. 

Given an NFA N, let \N\ be the number of its states. For NFAs A and B, 
it is well known that determining whether L(A) C L(-B) (and thus whether 
L(A) = L{B)) is in PSPACE with respect to an instance size of 0(\A\ + \B\). 

2. Proof of Theorem 11.11 

In the folklore result below, regular languages that satisfy (LI) are charac- 
terized in terms of two conditions on their minimal automata; a deterministic 
automaton satisfying the first condition below is sometimes referred to as a 
sink automaton. 

Lemma 2.1. If L ^ T* , then L satisfies (LI) if and only if the minimal 
automaton M(L) of L satisfies: 

(1) Among the states Q of M{L), there is a unique non-accepting state 
g; moreover g is a sink. 

(2) If q G Q — {g}, the language L q is contained in L. 

Proof. Suppose L ^ T* and L satisfies (LI). Let u,v G T*. \iu^L and 
v G" L, then for all c G T* , by (LI), we have both that uc L and vc G" L. 
By the definition of we have u and v are in the same class of It 
follows that there is only one non-accepting state, the class u/ where u 
is any word not in L. Denote the non-accepting state by g. By (LI) again, 
it follows that g is fixed by each element of T; that is, g is a sink. 

Suppose q G Q — {g}. It is shown that L q C L. Since M(L) is the minimal 
automaton for L, q is reachable from c/j n j. So suppose d G T* is such that 
Qini-d = q- Let c G L q . Then q.c / g, and qmi-dc / g, from which it follows 
that dc G L. Now (LI) guarantees that c G L. 

Conversely, assume L/T* and the two conditions above hold for L and 
M{L). Suppose uv G L, where u,v G T*. That M{L) is a sink automaton 
implies that qini-u G Q acc ; that is, u G L. Now using that L u C L, it follows 
that v G L. □ 

Given M(L), determining if L satisfies (LI) amounts to checking that 
the non-accepting state is a sink, and for each q G (5 acc checking that L q is 
contained in L. This last test is in PSPACE with respect to instance size 
0{\M{L)\). Also, given M(L), it is obviously in P to check that the single 
non-accepting state is a sink. The following has been proved. For a regular 
expression £, let \£\ be the number of symbols in E. 
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Lemma 2.2. Given a regular language L presented by a regular expres- 
sion £ or as a non- deterministic automaton N , then the problem with ques- 
tion"Does L satisfies (LI)?" is decidable; indeed, it is in EXPSPACE with 
respect to \N\ or \£\. 

Proof of Theorem II. 1L To prove Theorem 11.14 h suffices to show that 
given a regular language L satisfying (LI) and its minimal automaton M(L), 
whether (L2) is satisfied can be determined algorithmically. Of course to 
show that (L2) is decidable, it suffices to show that given o G T, it can be 
decided whether (L2) with o is satisfied. For the remainder of the proof, o 
will be a fixed element of T. From Lemma |2.1| it can be assumed M(L) is 
a sink automaton, and all states of M(L) are reachable from g^j. 

Definition 2.3. Let q G Q acc be such that for all n £ N, q.o n is in Q acc , 
and let P= q.o* = {q.o n : n G N U {0}}; P will be treated as a simple path 
that begins with q, and P will be said to be an o-path. 
Of course, there are no more than \M(L)\ o-paths. 

Definition 2.4. Let q G Q acc , and let P(o, q) be the set of all o-paths P for 
which there exists a directed path from q to the first state of P. 

After some motivation, for q G Q acc , a regular language L(o, q) will be 
defined: Suppose that L satisfies (L2) with o. Let x,y be contained in L. 
Since L satisfies (LI) and (L2), independent of n there exist a,b G L such 
that xao n by G L. Let q = qi n %-x G Q acc . Since L satisfies (LI), for all n G N, 
we have q.ao n G Q acc . Thus P = q.ao* is an o-path. Indeed, P is in P(o,q). 
From xao n by G L, it follows that for all p G P, the regular language L p f, 
contains y. That is, y G n p& pL p ^. Thus, if L satisfies (L2) with o, then for 
all q G Q acc and all y G L, there exists an o-path P in P(o, q) and a word 
b G L such that y G H pg p L p jy. 

Ignoring q for the time being, notice for a given y G L and o-path P, 
to check whether such a b G L exists, it suffices to check only those b G L 
having length no more than \M(L)\\ M ( L ^; this is because b only matters here 
up to its action on M(L). Related to this observation, and following for the 
same reason, if L satisfies (L2), given x,y G L, independent of n, there exist 
a,b G L such that \a\, \b\ < \M(L)\\ M ^\ and xao n by G L. 

Definition 2.5. We introduce the following notation for the sake of brevity. 

(1) LetK(o,q)=P{o,q) x (r(l M ( L )l |M(i)l )). 

(2) Let L(o,q) = ^l(P : b)eK(o,q)(<^ P epL p .b)- 

Note that since L satisfies (LI), it follows that L(o,q) is contained in L, 
and since L(o, q) is a finite Boolean combination of regular languages, L(o, q) 
is regular. For any q G Q acc , if L(o, q) is empty, observe that (L2) can not 
be satisfied with o. So it is harmless to assume L(o, q) is non-empty. 

Proposition 2.6. L satisfies (L2) with o if and only if for all q G Q acc , we 
have L(o, q) = L. 
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Proof. Suppose that for all q G Q acc we have L = L(o, q). Let x, y be in 
L and let q = qmi-x. Since y G L = L(o,q), there exists P € P(o,q), b £ L 
with |6| < |M(L)|'M(L)| such that y G n pe pL p . b . Let pi be the first state 
of P. Since P G P (o, </), there exists a G L such that g.a = p\. But now we 
have for n G N that qi n i.xao n by G Q acc ; equivalently, we have xao n by G L. 
./Vote i/wi a, 6 are 5oi/i independent of n. 

Conversely, suppose that L satisfies (L2) with o. Let q G Q acc . We need 
to show that L = L(o,q). Since M(L) is the minimal automaton of L, 
there exists x G L such that q = qini-x. Select arbitrary y G L. We will 
show that y G L(o,q). Since L satisfies (L2) with o, there exist a, b G L 
such that for any n G N we have xao n by G L. Moreover, we can assume 
that \b\ < \M(L)\\ M{L ^. Since L satisfies (LI), we have for all n G N that 
xao 11 G L; thus, qi n i.xao n G Q acc , and q.ao* is an o-path P having first state 
q.a. Moreover, since P is reachable from q — Qini- 

x, we have P G P(o,q). 

But now xao n by G L implies that y G n pG pL p .6, where P G P(o, g) and 
\b\ < \M(L)\\ M{ - L )\. That is, y G P(o,g). □ 

Completion of proof of Theorem 11.11 To prove the theorem, it suf- 
fices to show for a given q G Q acc that determining whether L = L(o, q) 
is decidable. Using the determinism of M(L), the collection P(o, q) can 
be constructed algorithmically. Decidability is proven if the Boolean com- 
bination of languages of the form L p b can be constructed algorithmically. 
We can algorithmically form a non-deterministic automaton N such that 
L(N) = L(o,q), where \N\ is bounded above by a recursive function in 
|M(L)|: Letting a = \M(L)\, by examining the parts of the Boolean ex- 
pression determining L(o, q), and using the standard constructions for non- 
deterministic machines that represent unions and intersections of regular 
languages, we can bound \N\ as follows. Let a = \M(L)\. Observe that 
\K{o, q)\ < a(a a ), and that for (P, 6) G K(o,q), we can determine n pe pL p .6 
with a non-deterministic automaton having no more than a 2 states. Thus 
to construct an NFA that accepts L(o,q), we need no more than 2(a)a a a 2 
states. It follows now that determining whether L is equal to L(o, q) is 
decidable, completing the proof of Theorem 11.11 □ 
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